Using Spectro-Temporal Features for Environmental Sounds Recognition
نویسندگان
چکیده
The paper presents the task of recognizing environmental sounds for audio surveillance and security applications. A various characteristics have been proposed for audio classification, including the popular Mel-frequency cepstral coefficients (MFCCs) which give a description of the audio spectral shape. However, it exist some temporal-domain features. These last have been developed to characterize the audio signals. Here, we make an empirical feature analysis for environmental sounds classification and propose to use the log-Gabor-filters algorithm to obtain effective time-frequency characteristics. The LogGabor filters-based method utilizes time-frequency decomposition for feature extraction, resulting in a flexible and physically interpretable set of features. The Log-Gabor filters-based feature is adopted to supplement the MFCC features to yield higher classification accuracy for environmental sounds. Extensive experiments are performed to prove the effectiveness of these joint features for environmental sound recognition. Besides, we provide empirical results showing that our method is robust for audio surveillance Applications.
منابع مشابه
Phoneme Classification Using Temporal Tracking of Speech Clusters in Spectro-temporal Domain
This article presents a new feature extraction technique based on the temporal tracking of clusters in spectro-temporal features space. In the proposed method, auditory cortical outputs were clustered. The attributes of speech clusters were extracted as secondary features. However, the shape and position of speech clusters change during the time. The clusters temporally tracked and temporal tra...
متن کاملA phoneme recognition framework based on auditory spectro-temporal receptive fields
We propose to incorporate features derived using spectrotemporal receptive fields (STRFs) of neurons in the auditory cortex for phoneme recognition. Each of these STRFs is tuned to different auditory frequencies, scales and modulation rates. We select different sets of STRFs which are specific for phonemes in different broad phonetic classes (BPC) of sounds. These STRFs are then used as spectro...
متن کاملTimbre Recognition with Combined Stationary and Temporal Features
In this paper we consider the problem of modeling spectro-temporal behaviour of musical sounds, with applications for musical instrument recognition. Using instanteneous sound features, such as cepstral envelopes and cepstral derivatives, the temporal evolution of the sound is transcribed into a new representation as a sequence of spectral features. Applying information-theoretic sequence match...
متن کاملAuditory abstraction from spectro-temporal features to coding auditory entities.
The auditory system extracts behaviorally relevant information from acoustic stimuli. The average activity in auditory cortex is known to be sensitive to spectro-temporal patterns in sounds. However, it is not known whether the auditory cortex also processes more abstract features of sounds, which may be more behaviorally relevant than spectro-temporal patterns. Using recordings from three stat...
متن کاملDynamics of Spectro - Temporal Tuning in Primary Auditory Cortex of the Ferret
In this paper, we explore spectro-temporal tuning in primary auditory cortex following a sudden change in spectro-temporal content. We previously characterized the steady-state spectrotemporal tuning properties of cortical cells with respect to broadband sounds by presenting sounds with a spectro-temporal envelope of constant spectral density and angular frequency for several seconds. However, ...
متن کامل